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Abstract 


Weighted  integrals  of  random  processes  are  approximated  by  the  trape^idal  rule  based  on  a 
stratified  and  symmetrized  random  sample  of  size  n.  The  weight  functions  jlre  assumed  to  be  twice 
continuously  differentiable.  We  consider  the  rate  of  convergence  to  zero  of  the  mean-square  integral 
approximation  error  as  the  sample  size  increases  indefinitely.  For  randoiry  processes  which  are  twice 

mean-square  continuously  differentiable  it  is  shown  that  the  rate  is  /r  ,  just  as  without  a  random 

'  . 

component  | (Haber  {2} J?  For  random  processes  which  are  a  bit  more  than  once,  but  not  twice,  mean- 
square  continuously  differentiable  the  rate  is  shown  to  be  n^.  In  both  cases  the  asymptotic  constant  is 
also  determined. 
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1.  Introduction,  Results,  and  Discussion 

We  consider  the  numerical  approximation  of  the  integral 

t 

I(f)  =  jf(t)dt  (1.1) 

o 


of  a  function  /  over  a  finite  interval.  The  simple  Monte  Carlo  numerical  approximation  of  /  takes  the 
form 

n  i*  1 

where  U  i ,  •  •  •  ,  Un  are  independent  random  variables  each  with  a  uniform  distribution  over  the  interval 
[0,  1].  When  /is  square  integrable  the  mean-square  error  is  given  by 


£[/(/) -40)</)]2  = 1  {Hf) ~  U(f)]2) 

n 

and  the  rate  of  n~l  cannot  be  improved  by  imposing  additional  smoothness  assumptions  on/. 

Haber  [1]  introduced  a  stratified  sampling  scheme  whereby  the  interval  [0,  1]  is  partitioned  into  n 
subintervals  An<i,  i  =  1,  •  ,n,  of  equal  length  and  a  point  Un  i  is  chosen  at  random,  i.e.,  uniformly 
distributed,  in  An  i  (the  Un/s  being  independent  for  each  n  ).  Then  the  stratified  Monte  Carlo 
approximation  of  the  integral  Iff)  is 


41}(/)=  7  £/«/.,,) • 

n  1 

When  /has  a  continuous  derivative  on  [0, 1]  the  rate  of  quadratic-mean  convergence  is  n-3  [1], 


lim  n3  E[I(f)-jM(f)]2  =  -±-  j  \f\t)]2dt , 

and  this  rate  cannot  be  improved  by  imposing  further  smoothness  requirements  on  /.  In  order  to  obtain  a 
faster  rate  of  convergence  when  /  has  a  continuous  second  derivative  on  [0, 1],  Haber  [2]  adopted  the 


>r 


>n _ J 


antithetic  variates  method  and  considered  the  following  stratified  and  symmetrized  scheme  where  along 


V 


with  each  U„ >(  its  antithetic  point  (i.e.,  the  symmetrically  opposite  point  to  Unj  in  An  i)  is  used.  The  f  y  Codej 

and/or 

stratified  and  symmetrized  Monte  Carlo  approximation  of  the  integral  1(f)  is  j  i  at  lal 
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iu(f) = 1  i  T  +/(tr».1)] .  (1.2) 

n  j»|  * 

If/has  a  continuous  second  derivative  on  [0,1]  then  the  rate  of  convergence  is  n~s  and 

,  i 

]im  (2n)sE [I (f )- I^(f )]2  =  \  \f'(t))2  dt .  (1.3) 

n—**>  45  o 

In  this  paper  we  consider  weighted  integrals  of  random  processes  and  establish  the  rate  of 
quadratic-mean  convergence  and  the  asymptotic  constant  for  estimates  of  the  form  (1.2),  allowing  for 
nonuniform  partitions.  Throughout  this  paper  X  =  {X(r,t»),  OS  t  £  1  }  is  a  measurable  second-order 
random  process  with  mean  zero,  E  [X (/)]  -  0,  and  covariance  function  R  ( t,s )  =  E[X (t)X (j)],  defined  on  a 
probability  space  (Cl,F,  P).  We  shall  be  concerned  with  the  numerical  approximation  of  the  integral 

i 

l(fl 0=  ]f(t)X(t)dt  (1.4) 

o 

l 

which  exists  as  a  sample  path  integral  whenever  J  If  (01  R*(t,  Odt  <°°.  (We  suppress  the  probability 

o 

variable  eo  and  write  X(0  for  X  (t,  to)).  Integrals  of  the  form  (1.4)  are  common  in  detection  and 
estimation  problems.  Unlike  (1.2)  we  allow  the  partition  of  the  interval  [0, 1]  to  be 

nonequally-spaced  and  we  adopt  "regular"  partitions  0  =  f*,  o  <  l  <  '  “  <  *n,n  =  1  defined  by  means  of 
continuous,  strictly  positive,  probability  density  function  h(0  on  [0, 1]  as  follows: 

Vi  , 

jh(0dt  =  —  ;  i*0,  1,  n.  (1.5) 

o  n 

We  set 


=  (f^j-i  , !«,»)»  •  - 1*,/  ~  t»,i- )  »  «  =  1,  ■  •  •  ,  n  .  (1.6) 

When  h(t)-\  we  obtain  a  uniform  partition  of  [0, 1].  It  will  be  seen  subsequently  that  the  quality  of  the 
approximation  can  be  improved  by  tailoring  the  density  h  (t)  to  the  covariance  R  ( t ,  j)  of  the  process  X. 


We  assume  that  for  each  n  £  1, 
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i)  U^i  is  uniformly  distributed  oveM*.,,  i  =  1,  •  •  •  ,  n, 

ii)  t/*,  i ,  •  •  •  ,  UH>H  arc  independent, 

iii)  {£/„_,  }  ",i  is  independent  of  the  process  X. 

We  denote  the  antithetical  point  of  U^t  by 

(1.7) 

where  c„t,  is  the  midpoint  of  An  i, 

=  y  +  Am).  (1*8) 


The  stratified  and  symmetrized  Monte  Carlo  approximation  of  the  integral  I(fX)  of  (1.4)  is  now  defined 
as 


/2-(/X)  =  |£ 

z  i*t 


f(.uHti)X(uH,i)+f(.u'n>iyx(u'Hti ) 


(•A/*,,- 


0.9) 


and  is  in  fact  a  trapezoidal  rule.  We  first  establish  an  expression  for  the  quadratic-mean  approximation 
error  under  general  conditions.  This  is  useful  for  evaluating  finite  sample  size  performance  and  for 
studying  the  asymptotic  convergence  properties. 

t 

THEOREM  1,  If  J  f2(t)R  (r,r)  dt  <<*>  then  for  all  n  £  1  we  have 
o 


[f\t)R  (t,  t )  +f(t)R  (t,  2 cm,i  -  t)f  (2c„'i  -  0)  dt 


-J  j  f(t)R(t,s)f(s)dtds 

-Vi  -Vi 


(1.10) 


We  next  show  that  when  the  function  /  has  a  continuous  second  derivative  and  the  process  X  has 
essentially  one  (but  not  two)  quadratic-mean  derivative  which  is  mean-square  continuous,  then  the  rate  of 
convergence  of  the  quadratic-mean  integral  approximation  error  is  precisely  n-4  ( not  n~5  !).  Specifically 
we  make  the  following  assumption. 
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ASSUMPTION  A. 

i)  /  has  a  continuous  second  derivative  on  [0, 1  ]. 

ii)  The  covariance  function  R  (t,  s)  of  the  process  X  has  continuous  mixed  derivatives  /?*•'(*,  s)  of 
order  2,  0  £  k  +j  £  2,  on  the  unit  square  [0, 1]  x  [0. 1];  and  of  order  3,  it  +j  =  3,  off  its 
diagonal  with  finite  one-sided  limits  at  the  diagonal  which  are  continuous  along  the  diagonal. 

iii)  The  function  r(t)-R  (t,  t )  has  a  continuous  third  derivative  on  [0,1]. 

The  assumption  of  continuous  mixed  derivatives  of  R  of  order  up  to  2  on  [0, 1]  x  [0, 1]  is  equivalent 
to  the  assumption  that  the  processes  X  has  one  mean-square  continuous  quadratic-mean  derivative.  The 
additional  assumption  of  differentiability  of  order  3  off  the  diagonal  is  weak  and  is  always  satisfied  when, 
for  example,  X  is  stationary,  has  rational  spectral  density,  and  exactly  one  quadratic-mean  derivative.  The 
smoothness  assumption  on  r  (r)  is  very  weak  and  is  always  satisfied  in  the  stationary  case.  With 

Raj(,t,t)=  lim  R^(u,v) ,  R&(t,  r)  =  lim  RKj  (u,v) ,  (1.11) 

-*•'>  ^vj-Kw) 

the  one-sided  limits  of  the  derivatives  of  R  above  and  below  the  diagonal,  respectively,  we  set 

fitj(0-ti/(i.0-RP(t,t)  (1.12) 

which  exist  under  Assumption  A  for  k+j  =  3.  We  can  now  state  one  of  our  main  results. 

THEOREM  2.  Under  Assumption  A  we  have 

]  2 

lim  (2/»)4  E[I(JX)-l2,,(fX))2  =  j  [303.0(0 +  7/J2.,(')]<fc  •  0-13) 

«-»-  120  J0  h*(t) 

It  is  seen  that  the  rate  of  quadratic-mean  convergence  is  precisely  n-4,  provided  303,0(0  +  702,i(O 
is  not  identically  zero,  and  cannot  be  improved  by  additional  smoothness  of  /.  In  case  the  third  order 
mixed  derivatives  of  R  are  continuous  at  the  diagonal,  the  asymptotic  constant  in  (1.13)  is  zero  and  the 
mean-square  approximation  error  is  o(n~ 4).  The  asymptotic  constant  in  Theorem  2  depends  on  the 
density  function  h  of  the  regular  partition.  The  optimal  density  h*  which  minimizes  the  asymptotic 
constant  in  (1 . 1 3)  is  given  by 
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{/2«[303.o(O  +  702.i(O]}2/5 
i 

J  [f2(u)[3^0(u)  +  ip2A(u)}}2/5  du 


for  which  (1.13)  becomes 


(U4) 


Um(2n)4£[/(/X)-/^0X)]2 

#1— » oo 


1 

120 


1  r 


J  /2(O[303.o(O  +  702.i(r)] 

0  L 


2/5 


dt 


When  the  process  X  is  weakly-stationary,  R  ( t ,  s)-R{t-  s).  Assumption  A  simplifies  to 
ASSUMPTION  A'  (STATIONARY  CASE). 

i)  /  has  a  continuous  second  derivative  on  [0, 1]. 

ii)  The  covariance  function  R  (t)  has  a  continuous  derivative  of  order  2  on  the  entire  real  line  and 
of  order  3  away  from  the  origin  with  finite  one-sided  limits  R®\± 0). 

With 

&  4/?(3)(0+)-/?(3>(0-)^0,  (1.15) 

Theorem  2  becomes 

COROLLARY  1,  When  the  process  X  is  stationary,  under  Assumption  A\  we  have 

lim  (2/t)4  £[/(/*) -/2*(/X)]2  =  ^ -  J  dt .  (1.16) 

30  5  h*(t) 


It  is  seen  that  if  R®\t)  is  discontinuous  at  the  origin,  the  quadratic- mean  convergence  rate  is 
precisely  n"4;  if  R(3)(r)  is  continuous  at  the  origin  then  03  =0  and  the  mean-square  approximation  error 
is  o(n~*).  The  asymptotically  optimal  density  is  now  given  by 

>•(»=,  ■/»>'"  . 

o 

for  which  (1.16)  becomes 


lim (2n)A  E[I(JX)~ 4(/Y)]2  =  \ Jl/  (0 1 2/5  dt] 
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It  is  clear  from  Theorem  2  that  for  the  approximation  of  weighted  integrals  of  random  processes  we 
obtain  a  quadratic-mean  convergence  rate  of  n~*  when  the  weight  has  two  continuous  derivatives  and  the 
process  has  essentially  one  but  not  quite  two  mean-square  continuous  quadratic-mean  derivatives.  We 
now  show  that  under  an  additional  smoothness  condition  on  the  covariance  function  R  (t,  s )  of  the  process 
X,  we  can  obtain  a  rate  of  n~s  for  weighted  integrals  of  random  processes.  To  this  end  we  set 

ASSUMPTION  B. 

i)  /  has  a  continuous  second  derivative  on  [0, 1]. 

ii)  The  covariance  function  R  (t,  s )  of  the  process  X  has  continuous  mixed  derivatives  Rk'J(t,  s )  of 
order  4, 0  £  k  +  j  £  4,  on  the  unit  square  [0, 1  ]  x  [0, 1]. 

Part  (ii)  of  Assumption  B  is  equivalent  to  the  assumption  that  the  process  X  has  two  mean-square 
continuous  quadratic-mean  derivatives.  We  then  have  our  second  principal  result. 


THEOREM  3.  Under  Assumption  B  we  have 

1  j 

lim  (2n)5  E  [/  (fX)  —  / 2*(POJ2  =  j  TJTT"  &  0-17) 

a— *«•  43  q  n  (r) 

where 

A  2(f)  =  R  (t,  t)[f"(t)]2  +4  R 1,0 (r, 

+  4  yK2-°(f.  +  t)(f(t))2  +  4  R^'it,  R2,2(t, 

=  Elt/COXWn2  (1.18) 


and  differentiation  of  the  process  X  is  meant  in  quadratic-mean. 

Since  A2(t)  cannot  be  identically  zero,  the  rate  of  quadratic-mean  convergence  is  precisely  n~5  and 
cannot  be  improved  by  additional  smoothness  of  /  or  R.  As  in  the  discussion  following  Theorem  2,  we 
can  select  the  partitioning  density  h  so  as  to  minimize  the  asymptotic  constant  in  Theorem  3.  We  obtain 


»-w.  ,-MWg. 
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for  which  (1.17)  becomes 

lim(2«)5  E[I  (JX)  -  l2j,(fX)]2  =  -j-i  f  |  A  (f)  | 2/6  dt  -  . 
lt-4~  45  I  q 

We  now  specialize  Theorem  3  to  the  stationary  case.  Here  Assumption  B  simplies  to  the  following. 
ASSUMPTION  B'  (STATIONARY  CASE). 

i)  /  has  a  continuous  second  derivative  on  [0, 1]. 

ii)  The  covariance  function  R  ( t )  has  a  continuous  derivative  of  order  4. 

We  then  have 

COROLLARY  2.  When  the  process  X  is  stationary,  under  Assumption  B\  we  have 

1  a2( 

lim  (2 n)s  E  [/  (fX)  -  /2,(/X)]2  =  J  dt  (1.19) 

«-»-  as  o  h  (r) 

where 

A  2(r)  =  R(Q)[f"(t)]2  +  2[  -  R"(0)I  {2(f'(f)I2 -/(r)/"(r))  +  Rw(0)f2(t) 

=  E  {[/(r)X(r)]"}2.  (1.20) 

The  asymptotically  optimal  partitioning  density  is  now  given  by 

A*(0  =  T 

j[A\u)}mdu 

o 

In  the  stratified  and  symmetrized  Monte  Carlo  approximation  considered  in  this  paper,  the  randomly 
chosen  points  { U„  i )  are  uniformly  distributed  within  each  subinterval  of  the  regular  partition.  On  the 
other  hand  one  may  wish  to  retain  the  property  of  the  crude  Monte  Carlo  whereby  the  randomly  chosen 
points  {(/„  ,  )",!  are  uniformly  distributed  over  the  domain  of  integration  [0, 1].  Such  an  approach  leads 
to  trapezoidal  Monte  Carlo  integration  which  was  considered  in  Yakowitz  et  al.  [4]  for  integration  of 
(deterministic)  functions  /  and  in  Masry  and  Cambanis  [3]  for  weighted  integrals  of  random  processes.  It 


-9- 


may  be  of  interest  to  provide  a  comparison  of  the  performance  of  these  two  integral  approximation 
schemes  (both  of  which  use  trapezoidal  rules)  under  identical  assumptions  on  the  integrands.  For  the 
trapezoidal  Monte  Carlo  approximation  we  use  independent  random  variables  U\,  •  •  •  ,Un  uniformly 
distributed  on  [0, 1],  independent  of  X,  and  we  let  t^o  =0  <  th,\  <  tn.2  <  *  •  •  <  <  1  4rn  n+1  be 

the  corresponding  ordered  sample.  The  integral  (1.4)  is  approximated  by 

Q&fJX)  =  j  £  \f(t^yX(t „,.)  +/(T,.i+i)X(T^+1)]  (Tnil>1  —TH'j)  .  (1.21) 

For  simplicity  we  state  below  the  convergence  properties  of  in  the  stationary  case  only.  Under 
Assumption  A’  we  have  [3] 

i  1  1 

lim  n*E  [I(JX)  -  ffflVX)]2  =  703  +  -jE[(fX)X  D  ~  (/X)'(0)]2  =  Ctrap  . 

n — ►“>  4  q  ** 

In  order  to  compare  this  to  Corollary  1,  we  assume  even  sample  size  N  so  that 

Hm  4  E^IifX)  -  /)vwP>  (/X)]2  =  Ctnp 
whereas  by  Corollary  1  with  h  (/)  =  1  we  have 

lim  N*E\im-lN(fX)Y  =  %  jf\t)dt  =  CiXl. 

L  J  JU  q 

It  is  clear  that,  while  the  symmetric- stratified  and  the  trapezoidal  Monte  Carlo  approximations  have 
identical  rates  of  quadratic-mean  convergence,  their  corresponding  asymptotic  constants  satisfy 
(C imp  /  CM)V*  >  (45/2) 1/4  =2.18  and  thus,  asymptotically,  for  the  same  accuracy  measured  in  teims  of 
quadratic-mean  error,  more  than  twice  as  many  samples  are  required  for  the  trapezoidal  scheme.  This 
discrepancy  also  appears  in  the  example  below  where  the  finite  sample  size  performance  is  evaluated. 

Finally,  it  may  be  interest  to  examine  the  performance  of  the  strati fied-symmetri zed  Monte  Carlo 
integral  approximation  when  the  function  /  and  the  process  X  satisfy  weaker  smoothness  conditions  than 
those  stated  earlier.  For  integrals  of  random  process  l(fX)  we  assume  for  simplicity  that  /  3  1  and  X  is 
wide-sense  stationary  process.  The  following  table  of  quadratic-mean  convergence  rates  complements 
Theorems  2  and  3.  The  additional  rates  displayed  in  the  table  can  be  established  in  the  manner  of  the 
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proofs  of  Theorems  2  and  3. 


/*(/) 

/*(*> 

Smoothness 

Rate 

Smoothness 

Rate 

/continuous 

o(n~l ) 

R  continuous 

o(n~l) 

R'  continuous 

o(n~2) 

/'  continuous 

R"  continuous 

o(n~ 3) 

R "  continuous  & 

n~* 

R(3)(0±)  finite,  * 0 

/"  continuous 

n“5 

/?(4)  continuous 

n~5 

Recall  that  a  wide-sense  stationary  process  X  has  k  mean-square  continuous  quadratic-mean 
derivatives  if  and  only  if  is  continuous.  It  is  then  seen  from  the  table  that  when  the  nonrandom 
function /or  the  stationary  process  X  have  0,  1,  or  2  derivatives,  usual  or  quadratic -mean  respectively, 
the  rates  of  convergence  of  the  mean-square  approximation  error  of  their  integrals  an?  identical.  For  the 
approximation  I^ifX)  of  I  (JX)  with  mixed  smoothness  conditions  on  /and  on  X,  it  can  be  shown  that  the 
slower  rate  prevails.  Thus,  for  example,  if  /'  is  continuous  and  R(4)  is  continuous,  the  rate  of 
convergence  of  the  mean-square  approximation  is  o(n~3). 

Thus  the  ultimate  rate  of  convergence,  n~5,  of  the  symmetric-strati fied  Monte  Carlo  approximation 
of  /  ( fX )  is  achieved  when  the  nonrandom  function  /  has  two  continuous  derivatives  and  the  random 
process  X  has  two  mean-square  continuous  quadratic-mean  derivatives;  i.e.  when  the  usual  smoothness  of 
/and  the  quadratic-mean  smoothness  of  X  arc  comparable.  This  is  in  contrast  with  the  trapezoidal  Monte 
Carlo  approximation  of  l(fX)  whose  ultimate  rate,  n-4,  is  achieved  when  the  nonrandom  function  / has 
two  continuous  derivatives  and  the  random  process  X  has  one  mean-square  continuous  quadratic-mean 
derivative  and  continuous  mixed  partial  derivatives  of  R  of  order  3  off  the  diagonal,  but  not  two 
quadratic-mean  derivatives;  i.e.,  when  the  quadratic-mean  smoothness  of  X  is  less  than  the  usual 
smoothness  of/! 
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EXAMPLE.  We  illustrate  via  an  example  the  finite  sample  size  performance  of  the  strati  fied- 
symmetrized  Monte  Carlo  approximation  and  compare  it  to  that  of  the  trapezoidal  Monte  Carlo 
approximation.  We  consider  a  stationary  process  X  with  mean  zero  and  covariance  function 

/?(0  =  0+y|r|)«_yl'1 


where  y  >  0.  Note  that  /?  (0)  =  1  and  the  process  has  exactly  one  quadratic  mean  derivative.  Moreover, 
/?(3)(r)  is  discontinuous  at  the  origin  so  that  Corollary  1  is  applicable  and  the  rate  of  mean- square  error 
convergence  of  IjnifX)  is  n-4.  For  simplicity  we  take  f(t)s  1  so  that  the  integral  to  be  estimated  is 

l 

l(X)  =  jX(t)dt.  We  choose  an  equally-spaced  partition,  h(t) si,  for  which  the  approximation  (1.9) 


becomes 


l2n(X)  =  £  jx  ((/„.,•)  +  X(tr„,1) 


The  variance  a2  of /(X)  is  given  by 


l  l 

//■ 

oo 


2 

Y 


•  ' 

i  +  i 

e  y 

1  r 

r 

4 

From  Theorem  1  we  find  after  some  algebra,  that  the  mean-square  error  is  given  by 
E[I(X)-1 2*(X)]2  = 

The  asymptotic  constant  C,u  is  given  by 


*  ' 

1  6n 

\  1  +  e~rln ' 

2/i  v2 

».  9  j 

l  J  y 

l  J 

C  JU.  — 


=  _zL 

120 


Let  N  =•  2n  =  2, 4,  •  •  •  ,  be  the  (true)  sample  size  with  corresponding  mean-square  error 
mse  (N)  =  E  [/  (X)  -  /m(X)]2.  The  fractional  mean-square  error  is  then  given  by  mse(N)/o2.  In  order  to 
select  appropriate  values  of  y  for  numerical  display  of  the  finite  sample  size  performance,  the  behavior  of 
the  fractional  error  mse( 2)1  a2  (  based  on  2  samples)  as  a  function  of  y  was  investigated.  Table  1  below 
lists  the  results  along  with  the  value  of  the  asymptotic  constant. 
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r 

mse(2)lo2 

Cftr 

.2 

6.055  xlO-5 

6.66  xir5 

1 

5.456  xlO-3 

8.33  xlOr3 

3 

8.02  xlO-2 

.225 

5 

.2321 

1.04 

■ 

.4293 

2.858 

.7646 

8.333 

15 

1.361 

28.125 

20 

1.973 

66.666 

We  select  two  values  y  =  5  and  y  =  10  corresponding  to  moderate  values  of  mse  (2)/cr2. 

In  Figure  1  the  fractional  mean-square  error  mse(N)la2  is  plotted  as  a  function  of  the  sample  size 
N  =  2,  4,  6,  •  •  ■  ,  30  for  y  =  5  and  y  =  10.  It  is  seen  that  for  the  smaller  value  of  y  =  5,  the  fractional  error 
is  considerably  smaller  for  each  sample  size  N.  This  can  be  explained  by  the  less  rapid  decay  of  R  (/)  and 
hence  the  larger  correlation  between  consecutive  samples  so  that  In(X)  provides  a  better  estimate  of  /  (X) 
in  this  case.  The  closeness  of  the  fractional  mean-square  error  to  its  asymptotic  value, 

,  C,u/a2 

mse(N)lo2 - r —  , 

Na 

is  displayed  in  Figure  2  for  parameter  y  =  5.  Note  that  the  asymptotic  value  overestimates  the  true  error 
for  all  sample  sizes  N  in  the  plotted  range.  Naturally,  the  discrepancy  between  the  two  values  diminishes 
as  N  increases. 

It  may  be  of  interest  to  compare  the  above  finite  sample  size  performance  to  that  of  the  trapezoidal 
Monte  Carlo  approximation  (1.21).  For  the  latter  approximation  the  expression  for  the  mean-square  error 
for  a  finite  sample  size  is  given  in  [3,  Eq.  (1.13)].  In  Figure  3,  the  fractional  mean-square  errors 
mse(N)la2  are  plotted  as  functions  of  the  sample  size  N  =  2,  4,  •  •  •  ,  30  for  y  =  5.  It  is  seen  that  the 
symmetric-strati fied  approximation  outperforms  the  trapezoidal  Monte  Carlo  approximation  by  a  wide 
margin  for  all  sample  sizes  N  in  the  plotted  range.  For  1%  fractional  mean-square  error  6  samples  arc 
required  for  the  symmetric-strati  fied  approximation  but  12  samples  for  the  trapezoidal  Monte  Carlo 
approximation;  for  .1%  fractional  mean-square  error,  the  corresponding  numbers  of  samples  are  12  and 
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24  respectively.  Asymptotically,  for  large  N,  we  can  compare  instead  the  asymptotic  constants 

Cwp  =  {l+6y  +  (y-l)«-i,}y2/2,  Cttr=y3/ 120 
and  it  is  seen  that  for  all  y  >  0, 

^inp^itr  ^  360 

so  that,  for  the  same  mean-square  error,  the  trapezoidal  Monte  Carlo  approximation  requires  a  sample 
size  N  greater  than  that  of  the  symmetric-strati  fled  Monte  Carlo  approximation  by  a  factor  of  at  least 
(360)1/4  =  4.35.  When  y  =  5,  [C^lC„]Vt  =  (372.32)*  =  4.393. 

2.  Derivations 

In  order  to  simplify  the  writing  throughout  this  section  we  will  drop  the  subscript  n  from 

An ,i*  Ar„,(-  ,  Cnj,  j . 

PROOF  OF  THEOREM  1.  The  expectation  in  E[l(fX)-l2n(JX)]2  is  with  respect  to  both  the 
random  samples  {l/,  }"=i  and  the  random  process  (X(r),  0  S  r  £  1}  which  are  mutually  independent.  We 
first  verify  that  both  /  (fX)  and  l2 A(JX)  have  finite  second  moments.  To  simplify  the  notation  we  put 
Y{t)-f  (t)X (r)  and  M(t,s)=f(t)R  (r,  s)f  (s).  It  follows  that 

i  u  i 

E  {|/(T)|2}^£{ \E{  |  K(r)  |  dt  }2  =  ||£{  |  T  (t)Y  (s)  |  }  dtds  <  ( J  Af  *(r,  r) dr  )2  <  <» 

0  00  0 

where  we  used  E2{  |  K(r)T(s)| }  2  E{Y2(t))  E  (y2(s)}  =M(t,  t)M(s,  s ) .  The  more  restrictive  condition 

i 

jAf(r,  t)dt  <~  is  needed  for  the  finiteness  of  the  second  moment  of  /^(K).  Indeed,  taking  first  the 
o 

expectation  with  respect  to  the  random  samples,  we  find  for  each  i, 

£{y2(I/()}  =E  j-^-  f  r2(r)drj=-^-  Ja f(r,  t)dt<  ~. 

These  inequalities  justify  the  interchanging  of  integrals  and  expectations  below.  In  view  of  (1.1) 
and  (1.9)  the  integral  approximation  error  can  be  written  as 
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t(Y)  -  lu(Y)  =  £  |  j  Y(t)dt  -  \a tiirm  +  YQCi  -  Ui)]  U  £  «i . 

i-1  (A  1  j  i-1 


The  bias  of  the  i*  error  term  is 


£{e,}=  £{£(e,-|X)}=  £^ 


J  y  (t)dt  -  y  J  [Y(t)  +  T (2c,  - 1)]  dt  l  =  E  {0}  =  0 
A  1  a,  J 


Thus  / 2*  is  an  unbiased  estimator  of  I: 


E[l(Y)-lu(Y))=  0. 


Since  given  the  random  process  X,  the  error  terms  { e, }  f,i  are  independent  with  zero  mean  we  have  for 

i  *j. 


E{eiej)  =  £{£(«,-«;  \  X))=E{E{ei\X)E{ej  |  X)}=£(0)  =  0 


It  follows  that 


mse2n  k E [/  (X)  -  1U(Y)}2  =  E  ]  £  e,4  =  £  Z  E  {e,e, }  =  £  £  {e? } 


!.« -1  J  i«l  y-1 

Performing  first  the  expectation  with  respect  to  the  samples  we  find 

r  ]2 

£{«?}=£  j  J  y (OA - 1  A/1[r(l/i)  +  K(2c, -t/,)]  • 


i-l 


=  £ 


jY(t)dt 

l 


-  At,  j Y(t)dt  [Y (Ui)  +  y(2c,  -  {/,)]  +  j  A tf[Ym  +  y  (2c,-  -  Uj)]2 


=  £ 


J  Y(t)dt 


V-I  J 


-  J  Y(t)dt  J[y(j) + y(2c,  -  «)]<&  +  j  a ti  j  [y(o  +  y(2c,  -  oi 

At  A  4  -A; 


dt 


=  J  J  M(t,  s)  dtds-  J  |  [A/(t,  s )  +  A/(t.  2c,  -  s)]  dtds 

A,  A,  A,  A, 


+  -7  At,  f  [M  (t,  t)  +  2M  (f,  2c,  -t)  +  M  (2c,  -  r,  2c,  -  f)]  dt 
4  a. 
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[M{t,  t)+M(t,  2c,  M(t,  s)  dtds  . 


and  (1.10)  follows  by  sumraatioa  □ 

PROOF  OF  THEOREM  2.  We  first  Taylor-expand  R(u)  for  (r,  s)  off  the  diagonal  of  A,xA,-  (i.e., 
t  *s)  about  the  center  (c,,  c,).  From  Assumption  A.  («)  we  have 

R  (r,  5)  =  R(c„  c.)  +  (r-  c.)  /?  1-°(c,  ,a)  +  (s-  c,)  R0,1  (c„  c,) 

+  y(f  -  c,)2  R2’0(c,i  c.)  +  y  (s  -  c,)2  R0,2(c,-,  c,)  +  (t  -  c,)(s  -  c.)  R 1,1  (c„  c.) 

+  “  c<)3  /?3'°( 1111 ) +  “  C')2(J  ~  c«)rt2,1(  mt )  (21) 

0  * 

+  4-(r  -  CiXs  -  c,)2  R  1,2(  int )  +  -j(s  -  c,)3  R0,3(  int ) 

2  6 


where  int  is  a  point  in  the  open  line  segment  determined  by  (r,  s)  and  (c„  c,)  (depending  of  course  on 
both).  We  also  Taylor-expand  the  function  r(r)  =  R(r,  f)  about  c,.  In  view  of  Assumption  A.(U)  -  (»0 
we  have  /(r)  =  2  R  '’°(r,  r),  r"(r)  =  2[R  *°(f,  0  +  R1*1  (t,  r)]  and  thus 

R  (r,  r)  =R(a,  c, )  +  (r  -  c,)2  Rli0(c„  c,)  +  (r-  c,)2[R2’°(ci,  c.)  +  R  l,l(e/.  c,)] 

+  |(r-c,)3  /"( int)  (2-2) 

6 


where  int  is  a  point  in  between  t  and  c,  (depending  on  both).  Substituting  (2.1)  and  (2.2)  into  (1.10)  and 
regrouping  terms  using  the  symmetry  of  R(f,  s)  we  obtain 

MSEn  =  E[I  (JX)  —  /2»(/X)]2  (23) 


=  £  R(Ci,  cM-trAti  J/2  +  /,  |/(r)/(2c, -t)dt  -(J/)2  • 

i-i  L  ^  4  J 


+  IR'-Vi.  c,-) 


Ar,  J  (r  -  cdfHt)  dt  -  2( J /)  J  (/  -  c,)/  (r)  dt 

A  A  At  j 


±E°n'°  (2.3.1) 


Ae\-°  (2.3.2) 


+  £  R2-°(c„  c,-)  j^-Ar,  J  (r  -  Ci)2f2(t)dt  +  ^Ar,  J  (r  -  c,)2f(t)f  (2c,  -  t)dt 
i-i  Lz  A  z  A 
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-(//)  j(t-ct)2f(Odt 

Ai  Ai  J 


(2.3.3) 


+  £  /?1-,(ci.ci)|^A/I  j  (r-c,)2/2(r)dr-|  Ar,  j  (t-d)2f(t)f(2ci-t)dt 
«= i  A  z  A 


(J  (r-c,)/(r)dr)2 
A 


(2.3.4) 


~Ern  (2.3.5) 


(2.3.6) 


+  £  ~k  At‘  J  -  c*)3/2(0^(  int)  dr 

«=1  ^  A, 

+  £  {  y  Ar,-  J  / (r)/ (2c, i  - r)[  |(r  -  c,)3  /?3-°(u„  2c,  -  u()  +  |(c,  -  r)3  /?°-3(u, ,2c,  -  «,) 

+  y(r  -  c,)2(c,  -  t)R2,l(u,  2c, ■  -  m,)  +  y(r  -  c,)(c,-  -  r)2  R  1,2(«<,  2c,  -  u,)  ] dr 
-  J  J  /  (0/  (s)[  -g-  (f  -  c,)3/?3-°(  int )  +  y(s  -  c,)3  R°\  int ) 

+  y  (r  -  c,)2(s  -  c,-) /?2,1(  int )  +  y(r  -  c,)(,s,  -  c,)2  /?  1,2(  int )  ]  dtds  j  ^E3 

In  (2.3.5),  int  denotes  a  point  in  Ai  depending  on  r.  In  the  first  term  of  (2.3.6),  R  (r,  2c,  -  r)  is  expanded 
about  R(c,  c, )  and  thus  the  point  u,  is  in  between  r  and  c,,  and  in  the  second  term  the  intermediate  point 
"  int "  is  in  between  (r,  s )  and  (c,-,  c,)  and  we  excluded  the  diagonal  t  =  s  from  the  integration  because  it 
has  zero  Lebesgue  measure. 

We  now  use  the  Taylor  expansion  of  /  (r)  about  c„  which  in  view  of  Assumption  A  .(i)  has  the  fotm 


/ (0  =/(c.)  +  (r  - c,)/'(c,)  +  y(r  -  c,)2/"[d(r.  c,)] 


(2.4) 


and  the  intermediate  point  d  depends  on  r  and  c,. 

Terms  involving  R(clt  c.)  (g2’°).  From  (2.4)  we  find 

J /=  Ar,/(c,)  +  Ar3  yj-  /"( int  j ) , 


A 


J/2  =  Ar,  f\a)  +  Ar?  -jy  (/(c,)/"(  int2)  + 1 fXd)\2 ) 


(2.5) 
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+  Arf  ±  /'(c,)[/"(  int4)  -fX  int3>]  +  Ar?  ^  [/"( int2)]2  , 


(2.6) 


J /(0/C 2c, -t)dt  =  AtJ2(c,)  +  Atf  - int,)  -  [/'(c,)]2} 


+  A fXcdU'X  int 3)  -/"( int4>]  +  Ar?  ^  /"( int5)/"(2c,  -  int5) , 


(2.7) 


and  substituting  into  (2.3.1)  we  obtain 


£2-0  =  £*(c„c,) Atf~ 

«» l  04 


~  (T( int2)]2  +  int5)/"(2c,  -  intj)  -  ±  [/"(  int,))2  i  •  (2.8) 


Using 


—  =  f  h(t)dt  =  h(  int)A/,- 

n  f 


(2.9) 


it  follows  from  (2.8)  by  Riemann  integrability  that  as  n 

l 

J 

o 


(2.10) 


Terms  involving  R  1-°(ci,  c,)  (£^°).  From  (2.4)  we  find 

J  (t  -  Cj)f  (O dt  =  At}  /'(c,)  +  Arf  -^-[/"(int4)-/"(int3)] , 

A,‘ 

J(t-c,)/2(0*  =  A/?  ^/(c,)/'(c,)  +  Ar?  / (c,)[/"(  int4)  -/"( int3)] , 
1  6  64 


(2.11) 


K 


+  Arf  -^/'(c,)/"(int6)  +  A/?  — ^  {[/"( intg)]2  -  (/"( int3)J2) , 


(2.12) 


and  substituting  them,  along  with  (2.5),  into  (2.3.2)  we  obtain 


£t.°=  £/?«.o(Ci.,Ci) 

i-i 


Arf  /'(c,)[| /"( int6)  -/"( int ,)]  +  o(«‘7) 


where  the  term  o(«“7)  is  uniform  in  i  since  by  (2.9),  A/,  £  (en)-1  for  some  e  >  0  as  h  is  bounded  away 


-i 
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(2.13) 


Terms  involving  /?2,0(c,-,  c.)  (£^°).  From  (2.4)  we  find 
J(f-c,)2/(/)d/  =  A/?  -jy/(c,)  +  A/?  -^/"(int6). 


(2.14) 


\{t-Ci)2f\t)dt=  At]  ~fif2(Ci)+  Al*  i\f\ci))2  +f(Ci)f"(  int6)} 


+  Arf  ^-/'(c,)[/"(intn)-/''(int10]-f-  At]  lT( int9)]2  . 

J  (f  -  c,)2/ (0/ (2c,  -t)dt-  At2  -jy  fZ(* )  +  Arf  -^-{/ (c,)/"(  int6)  -  [/'(c,)]2 ) 

+  A/f  /'(c<)(/"( intjo) -/"( intj,)]  +  At]  -p^-/"( int12)/"(2c,  -  intI2), 
and  substituting  into  (2.3.3)  along  with  (2.5)  we  obtain 

El'°  =  £  /?2-°(c„  c,) *  A/f  ~ /(c,)[y/"(  int6)  -  y/"(  int,)]  +  o(n^)| 
where  the  o(n~ 6)  term  is  uniform  in  i.  It  follows  from  (2.9)  that  as  n  -» <», 


(2.15) 


(2.16) 


(2.17) 


Terms  involving  R 111  (c,,  c.)  (E^1).  Substituting  (2.1 1),  (2.15)  and  (2.16)  into  (2.3.4)  we  obtain 


Ey  =  £  Rl'\ch  c,)|a/?  ^[/’(C,)]2  +o(n-*) 


where  the  o(n“°)  term  is  uniform  in  /,  and  thus  by  (2.9),  as  n  ->  », 

(lnsi  Eu  _>  _L  f  ^lSL  Ot/XQ}2 
(2«)  E„ 


(2.18) 


Terms  involving  /"  {Erm).  Substituting  (2.4)  into  (2.35)  we  obtain 

» 

E\  =  -jy  £  •  A/f  “  int 2)  -  r"'(  int  1 )]  +  o  (n-5) 
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where  the  o(n~5)  term  is  uniform  in  i,  and  using  (2.9)  we  find  a sn  -><*>, 

(2*)4  E\  ->  — l—  j  [/"«)  ~  r"'(t)]dt  =  0  .  (2.19) 

q  n  ( t ) 

Terms  involving  R3,0,R2%1  (El).  When  we  substitute  (2.4)  into  (2.3.6)  the  dominant  tenn 
corresponds  to  /  (c,).  We  thus  have 

it/2(c,)  +  o(l)]x  (2.20) 

<=•1 


x« 


7  Af(-  |  (r - c,)3[y /?3-°(u„  2c, R °’3  («„  2c, - u,) 


-  R2j(u„  2c,  -«,)  +  /?  '^(u,,  2c,  -  u,)]  dt 


(2.20.1) 


-  /  /  [\  (t  -  c,)3  /?3  0(  int )  +  -  c,)3  tf0*3  ( int ) 


+  y(f  -  c,)2(j  -  c.)  z?2-1  ( int )  +  y(r  -  c,)(j  -  c,)2  /?  1,2(  int  )]dtds 


(2.20.2) 


where  <?(1)  is  uniform  in  i.  In  view  of  the  possible  discontinuity  of  the  third  order  mixed  partial 
derivatives  of  R  at  the  diagonal  (Assumption  A.  («)),  we  proceed  as  follows.  For  the  first  term  (2.20.1)  in 

Ci  ti 

(2.20),  we  split  the  integral  J  into  the  two  parts  J  +  J  and  then  apply  the  mean  value  theorem  to  each 

A  <i-i  ci 

part,  since  ( t  -  c,)3  has  constant  sign  over  each  half-interval  (r,_i ,  c,),  (c,,  r,),  to  obtain,  e.g., 

c,  1. 

j(t-c,)3  R3,0(u,'  2c,  -u,)dt  =R3-°(ai)  j  (t -q)3  dt  +  R3’°(bO  j  (t  -  c,)3  dt 

K  *i-i  c< 

=  -j—  A/f[-^3,0(at) +  /?3,0(^i)] , 

64 

where  a\  and  b\  denote  intermediate  points  in  A,  xA,  above  and  below  its  diagonal  respectively.  (For 
R(t,  s ),  above  the  diagonal  means  t  <s,  and  below  the  diagonal  means  t  >  s).  For  the  second  term 
(2.20.2)  of  (2.20)  we  split  each  of  the  four  double  integrals  J  J  into  four  parts  corresponding  to  the 


-20- 


regions  above  and  below  the  diagonal  where  the  terms  (t  -  c,)3,  (s  -  c,)3,  (t  -  Cj)2(s  -  c,),  (t  -  ct)(s  -  c,)2 
have  constant  sign,  and  then  apply  the  mean  value  theorem.  For  instance  we  get 


jj  (f-c,)3  R2,0  ( int  )dt  ds='  jj  +  JJ  +  JJ  +  JJ  •  (r  —  c,)3  R3'°(  int)  dt  ds 

A,A,  i<s<tj  c, <i <1  <t,  <j <t  ti.x<j<i<ci  _ 

<t<c,  c,<t<t , 

-  &  +  *“<**?  *5  650 

=  tx>  ^|-9SJ-0(a5)  +  RJ-°(<l6)  +  9/!3'”(45)-*3-°(»6)| 


where  a*  and  6*  denote  intermediate  points  above  and  below  the  diagonal  in  Ai  x  /l,  respectively. 
Proceeding  likewise  for  the  remaining  terms  we  obtain 


£2  =  Z[f2(Ci)  +  o(l)]Af?x 
i-t 


1 


256 


-  j  /?3>0(a,)  +  j  R*‘°(.b ,)  +  j  /?°-3(a2)  -  j  R°'\b2 ) 


+  *^fl3)-K2*k*3)-*w(«4>  +  *u(*4) 


-  -g^  [-  9K3-°(a5)  +  K3-°(a6)  +  9 K3-°(fc5)  - /?3-°tf>6)] 

-  [ - £0,3(a7)  +  9K0,3(ag)  +  K°'3(67) - 9 K°'3(f>8)] 
-j^[-3R2’\a9)  +  7R2'\aM)  +  3R2'\b9)-lR2-\bl0) 


1 


2-960 


[  -  7  /?  l-2(<2 1 1 )  -4-  3  /?  12(a  12)  +  7  /? 1 2(^  u )  -  3  /? 1  2(fc  12)  ] 


Using  (2.9)  and  (1.12)  we  obtain  as  n 

1  A 

dtJ 

1 

0 


i  2 

(2/t)4£3-»  16j<a^j-^g-  -  j^3.o(0  +  y^o.3(0+/52.i(0-^t.2(0 
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By  (l.ll)  we  have  by  the  symmetry  of  R  that  Rik(t,  t)  =  R&(t,  t)  and  thus  f3j,k(t)  =  ~Pk,j{t)  for 
j  +  k  =  3.  Hence 

(2«)4  El  -»  J  ^  [3/J,o(0  +  7^2., (t)]  *  .  (2.21) 

The  final  result  follows  from  the  expression  (2.3)  of  the  mean  square  error  and  from  the  asymptotics  of  its 
terms  derived  in  (2.10),  (2.13),  (2.17),  (2.18),  (2.19)  and  (2.21).  □ 

PROOF  OF  COROLLARY  1.  In  the  weakly  stationary  case  Part  (ii).  of  Assumption  A  reduces  to 
Part  (ii)  of  Assumption  A',  while  Part  (mi)  is  automatically  satisfied.  Also  in  this  case  /?3>o (t)  =  -03  and 
/J2ti(r)  =  /}3,  and  thus  Corollary  1  follows  from  Theorem  2.  □ 

PROOF  OF  THEOREM  3.  The  proof  proceeds  along  lines  similar  to  the  proof  of  Theorem  2. 
Since  by  Assumption  B  (ii),  R(t,  s )  has  continuous  mixed  partial  derivatives  of  order  four,  its  Taylor 
expansion  (2.1)  has  the  point  (c„  c,)  in  place  of  the  intermediate  point  int  in  the  terms  of  order  three  and 
in  addition  it  has  the  following  fourth  order  terms: 

~  (t  -  c,)4  R4-°(  int )  +  \(t  -  c,)3(s  -  c,)*31  ( int )  +  \(t  -  c,)2(i  -  c')2K2’2(  int ) 

24  0  4 

+  j(t  -  Ci)(s  -  c,)3  R  u(  int )  +  -^(s  -  c,)4  R°'\  int )  (2.1)' 

where  int  is  again  a  point  in  the  open  line  segment  determined  by  (/,  s)  and  (c,  c,)  and  depends  on  (r,  s). 
Also  for  the  Taylor  expansion  of  r(t)  =  R  (t,  t )  in  (2.2)  we  now  have 

r(3>(r)  =  2[/?3’°(r,  t)  +  3R2'l(t,  r)]  ,  r(4>(r)  =  2[/?4-°(r,  t)  +  4/?31(r,  t)  +  3 R2'2(t,  r)] ,  (2.22) 

and  the  third  order  term  in  (2.2)  is  modified  as  follows  and  a  fourth  order  term  is  added: 

7(f  -  c.y’r^V.)  +  Tj(f  -  c,)4  r(4)(v,)  (2.2)' 

6  24 

where  v,  is  a  point  in  between  t  and  c„  depending  on  t.  When  (2.1)  -  (2.1)'  and  (2.2)  -  (2.2)'  are  now 
substituted  into  (1.1),  the  resulting  expression  for  the  mean-square  error  is  given  by 
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MSE„  =  £°-°  +  £»*°  +  £2.0  +£U  +  (£jy  +  (£2)'  +  £4 

where  the  first  four  terms  on  the  right  side  are  given  by  (2.3.1)  -  (2.3.4),  (£')'  and  (£3)'  are  modifications 
of  (2.3.5)  and  (2.3.6)  respectively  and  E%  is  a  new  term.  We  have 

(Erny  =  £  -p-WrOfo)  J(f  -  Ci?f\t)dt  +  7  J(f  -  C,)4  f\t)r«\vt)dt  [ .  (2.3.5)' 

1  U  l  A  *  A  J 

the  term  (£3)'  modifies  E\  of  (2.3.6)  and  simplies  to 

(£3)'=  -£/?3-°(c„c,)7  JJ  [(t-af+is-CifVWfWdtds, 

<=t  0  44 

-  £  £2’1(ci,  c.)  |jj[(r- c,)2(j  - c.)  +  (r -  c,)(5  - c,)2]/(0/(5)* .  (2.3.6)' 

i-l  2  44 

because  the  term  in  (2.3.6)  involving  J  is  identically  zero  once  £•'•*(«,,  2c,  -«,)  is  replaced  by 
Rj,k(cit  d).  The  additional  fourth  order  term  £4  is  given  by 

E\  =  £  j  A r,  j  / (/)/ (2c,  -  t)(t  - c,)4  £4’0  -  -g- Z?3*1  +  jR2-2  ~  * 1,3  +  ^  R°A  1  («*•  2c<  "  “«)* 


+  -kf  -  c,)(j  -  c,)3  /?  l-3(  int )  +  7-(J  -  c,)4  R°'\  int )]  dt  ds  (2.3.7) 

6  24 


where  u,  is  a  point  in  between  t  and  c,  and  "  int "  is  a  point  in  the  open  line  segment  of  (5,  t)  and  (c,,  c,). 
The  asymptotics  of  the  terms  (2.3.1)  -  (2.3.4)  are  given  in  (2.10),  (2.13),  (2.17)  and  (2.18). 

The  term  (£')'.  From  the  Taylor  expansion  (2.4)  of  /we  find 

J (*  -  c,)3/2(f)<*  =  Ar?  -^/(c,)/'(c,)  +  o(n'5) . 

J(*  -  Ci)4f2(t)dt  =  Arf  -^/2(c,)  +  o(«’5) , 
where  the  o(n~5)  terms  are  uniform  in  i.  Substituting  into  (2.3.5)'  we  obtain 
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;»i 


960 


V  (cdficdr^ia)  +  |  /2(ct)  r«  ( int ) 


+  o(0 


and  using  (2.9)  we  have  as  n 


(2n)5  (Ern)'^  ±  j  [2/  (t)f’(t)rW(t)  +  j/2(f)r(4)(01  7—  . 


hHt) 


(2.23) 


The  tenn  (El)',  Using  (2.5),  (2.1 1),  (2.14)  and 


J  (t  -  c,)3 / (/) df  =  A/f  /'(a)  +  At?  —■  (f"(  int u)  -/"( int  ,0)] .  (2.24) 


768 


we  obtain 


•  -  £  /?2,1(c,  c.) |A/f  / (c()/'(c,)  +  o(n*) 


where  the  o  (/1-6)  terms  are  uniform  in  i.  It  then  follows  by  (2.9)  that  as  n 

1 

0  h=(t) 


{In)5  {Ely  -*  -  J  -4  *3>0('.  0  +  4 ft2-1  (f.  0 


15 


dt 


(2.25) 


The  term  £4.  When  the  Taylor  expansion  (2.4)  of  /  is  substituted  in  (2.3.7),  the  dominant  term 
corresponds  to  /  (c,)  and  we  obtain 


^  =  ZL/z(c,)  +  o(l)]x 


i=i 


7 M  j  (r-c,y 
1 


R*’°  -  - /?3>1  +  -7  2’2  -  4- /? **3  +  7- /?0’4 
24  6  4  6  24 


{u„  2c j  -  ut)  dt 


(2.26) 


(2.26.1) 


7J 


77  (f  -  c,>4/?4  0(  int  )  +  ~r  (j  —  c,)4  fl04(  int )  +  7  (r  -  c,)2  (j  -  c,)2  2-2(  int  ) 
24  24  4 


JJ 


X(r  -  c,)3  (s  -  c,)/?3  ,(  int )  +  -j(f  -  c,)  (s  -  c,)3  /?I  3(  int ) 
6  6 


dsds 


(2.26.2) 


(2.26.3) 


where  o(l)  is  uniform  in  i.  Applying  the  mean  value  theorem,  we  find  that  the  term  (2.26.1)  equals 

Atf  *  —  I”/? 4,0  +/?0'4  -  4)? 3,1  -4/?1,3  +  6/?2*2!  (intj).  (2.26.1/ 

2*o(J*24  L  J 

and  the  term  (2.26.2)  equals 

-  Atf  -~1--  [3/?4-°(int2)  +  3/?0’4  (int3)+  10fl2*2(  int4)l  .  (2.26.2/ 

j'oU'Z4  L 

In  order  to  apply  the  mean  value  theorem  in  the  terms  of  (2.26.3),  the  square  A,  x  A,  is  split  into  its  four 
squares  with  half  size  over  each  of  which  (t  -  c,)(s  -  c,)  has  constant  sign.  We  thus  find 

j  j(t  -c,-)3  (s  -C/)/?3,1(  'mi)dtds  = 


Arf  ■  R*’\ int5)  -  /?3-‘(  inte)  +  /?3,,(  int7) -Rx\  int8) 


(2.27) 


where  int,  are  points  in  A,  x  A,,  and  likewise  for  the  other  term.  It  then  follows  from  (2.26),  (2.27)  and 


(2.9)  that  as  n 


(2 n)5  Eah  2s  Jdt  4^- j  ,,  J—  [tf4'°  +R°'a  -4/?3,1  -4/?1-3  +6/?2’2l(r,  t ) 

"  l  h\t)  2-80-24  l  J 

'  3^24  [3/?4’°  +  3/?0'4  +  10/?2,2](f’  0 


i  2  r 

=  f4^~  /?4-°(f,r)  +  4«3'1(r,r)  +  |l?2'2f1f)  dt 

60  £  A5(r)  [  3 


(2.28) 


Finally,  adding  (2.10),  (2.13),  (2.17),  (2.18),  (2.23),  (2.25)  and  (2.28),  we  obtain  (1.17)  and  (1.18). 


The  final  compact  expression  in  (1.18)  follows  by  straightforward  calculation.  □ 
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Fractional  mean-square  error  mse(N)la2  as  a  function  of  the  sample  size  N. 
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